Model Selection

Document image understanding

# Document image understanding

Paligemma Rich Captions

An image caption generation model fine-tuned on the DocCI dataset based on PaliGemma-3b, capable of generating detailed descriptions of 200-350 characters with reduced hallucination

Transformers English

Donut Base Finetuned Latvian Receipts V2

A model based on the Donut architecture, specifically fine-tuned for Latvian receipt data

Text Recognition

Donut Base Finetuned Latvian Receipts

This model is a fine-tuned version of donut-base on a Latvian receipt dataset, primarily used for receipt image processing tasks

Text Recognition

Donut Base Payslips

Document understanding model based on Donut architecture, specifically fine-tuned for payslip image processing

Text Recognition

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase